Survey of duplicate detection using progressive detection techniques
نویسندگان
چکیده
منابع مشابه
A Study of Progressive Techniques for Efficient Duplicate Detection
---Databases contains very large datasets, where various duplicate records are present. The duplicate records occur when data entries are stored in a uniform manner in the database, resolving the structural heterogeneity problem. Detection of duplicate records are difficult to find and it take more execution time. In this literature survey papers various techniques used to find duplicate record...
متن کاملImplication of Clone Detection and Refactoring Techniques using Delayed Duplicate Detection Refactoring
Code maintenance has been increased when the similar code fragments is reduced in the software systems. Refactoring is a change made to the internal structure of software to make it easier to understand and cheaper to modify without changing its observable behavior based on code, the refactoring mechanism is used to discover the clone detection. The proposed algorithm insists semantic relevance...
متن کاملChapter 2 Duplicate Record Detection Using Anfis
The problem of duplicate detection is to find out whether the same real-world object is represented by two or more distinct entries in the database. Duplicate detection is otherwise known as Record linkage or record matching. It is a greatly researched topic and is of vital importance in fields such as master data management, data warehousing and ETL (Extraction, Transformation and Loading), cu...
متن کاملDuplicate code detection using anti-unification
This paper describes a new algorithm for finding software clones. It is conceptually independent of the source language of the analyzed programs, working at the level of abstract syntax trees. The algorithm considers that two sequences of statements form a clone if one of them can be obtained from the other by replacing some subtrees. To our knowledge this notion was not previously employed in ...
متن کاملUsing Acoustic Diarization for Duplicate Detection
The following article describes the use of an acoustic diarization engine for duplicate detection on broadcast news. Diarization is typically used to partition audio into speaker homogeneous regions, or in other words, to determine “who spoke when.” In this setting, however, we use diarization to segment the recordings and group the segments into homogeneous clusters. Diarization is performed b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Engineering & Technology
سال: 2018
ISSN: 2227-524X
DOI: 10.14419/ijet.v7i1.9.9757